Word and syllable models for German text-to-speech synthesis
نویسنده
چکیده
The correct pronunciation of unknown or novel words is one of the biggest challenges for text-to-speech systems. In this paper we describe the implementation of unknown word analysis as a central component of the text analysis module in the Bell Labs German text-to-speech system. The implementation is based on a model of the morphological structure of words and on the study of the productivity of word forming affixes. One important subcomponent of the word model is a phonotactic syllable model which enables the system to handle orthographic substrings that are unaccounted for by the explicitly listed morphemes. Finally, we discuss issues for future research.
منابع مشابه
مراحل و نحوه ی تهیه ی دادگان های صوتی هجایی و دایفونی برای سامانه ی تبدیل متن به گفتار فارسی
Abstract Speech databases are part of the concatenative text to speech synthesis systems. Phonetic quality of the databases plays a significant role in the naturalness of the synthesized speech. This paper introduces two syllable and diphone speech databases for Persian and investigates the way of their development and their specifications and their advantages to each other. ...
متن کاملطراحی و ارزیابی یک مدل بازسازی گفتار به روش همگذاری واحدهای حساس به بافت نوایی
This paper describes the design and evaluation of prosodically-sensitive concatenative units for a Persian text-to-speech (TTS) synthesis system. Thesyllables used are prosodically conditioned in the sense that a single conventional syllable is stored as different versions taken directly from the different prosodic domains of the prosodically labeled, read sentences. The three levels of the Per...
متن کاملThe “kiel Corpus of Read Speech” as a Resource for Prosody Prediction in Speech Synthesis
The naturalness of synthetic speech depends strongly on the prediction of appropriate prosody. For the present study the original annotation of the German speech database “Kiel Corpus of Read Speech” was extended automatically with syntactic features, word frequency, and syllable boundaries. Several classification and regression trees for predicting symbolic prosody features, postlexical phonol...
متن کاملAcoustic correlates of word stress in German spontaneous speech
The acoustic properties of word stress have been explored in a number of studies. However, there is little research on German word stress, and even less on its realization in spontaneous speech. This paper tests whether parameters that have been found to implement word stress in mostly laboratory speech are also employed in a corpus of German spontaneous speech. Specifically, we consider spectr...
متن کاملLetter-to-Phoneme Conversion for a German Text-to-Speech System
This thesis deals with the conversion from letters to phonemes, syllabification and word stress assignment for a German text-to-speech system. In the first part of the thesis (chapter 5), several alternative approaches for morphological segmentation are analysed and the benefit of such a morphological preprocessing component is evaluated with respect to the grapheme-to-phoneme conversion algori...
متن کامل